The properties of protein family space depend on experimental design

نویسندگان

  • Victor Kunin
  • Sarah A. Teichmann
  • Martijn A. Huynen
  • Christos A. Ouzounis
چکیده

MOTIVATION Databases of protein families often exhibit drastically different properties of the protein family space. RESULTS We compared the properties of protein family space as reflected by exhaustive protein family databases and databases with predefined families. We used TRIBES, Protomap, ProDom and COGs as representatives of the exhaustive databases, and Pfam-A and Superfamily as databases that predefine families. We observe a power-law distribution of family sizes in all these databases, albeit in predefined databases the power-law line collapses before reaching smaller sized families. We discuss the future trends of this power-law distribution and suggest that saturation in the sampling of protein family space will result in a distortion of the power law in small family sizes. For larger genome sizes, predefined databases show logarithmic growth of the number of families per genome, whereas exhaustive databases exhibit a virtually linear relationship. All databases consistently differ in the proportion of protein families shared between taxa. Predefined databases have a larger number of protein families shared between the three domains of life, while exhaustive databases show a much more fragmented distribution. We argue that these discrepancies reflect alternative approaches to the trade-off issue of sensitivity versus specificity in the detection of homologous proteins. We conclude that these properties are complementary rather than contradictory, while describing the protein universe from different perspectives.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Prediction of Engineered Cementitious Composite Material Properties Using Artificial Neural Network

Cement-based composite materials like Engineered Cementitious Composites (ECCs) are applicable in the strengthening of structures because of the high tensile strength and strain. Proper mix proportion, which has the best mechanical properties, is so essential in ECC design material to use in structural components. In this paper, after finding the best mix proportion based on uniaxial tensile st...

متن کامل

Modeling and Optimization of Mechanical Properties of PA6/NBR/Graphene Nanocomposite Using Central Composite Design

Thermoplastic elastomer of PA6/NBR reinforced by various nanoparticles have wide application in many industries. The properties of these materials depend on PA6, NBR, and nanoparticle amount and characteristics. In this study, the simultaneous effect of NBR and graphene nanoparticle content on mechanical, thermal properties, and morphology of PA6/NBR/Graphene nanocomposites investigated by Cent...

متن کامل

Numerical Modeling and Experimental Study of Probe-Fed Rectangular Dielectric Resonator Antenna (RDRA) Supported by Finite Circular Ground Plane

Dielectric Resonator Antennas (DRAs) have received increased interest in recent years for their potential applications in microwave and millimeter wave communication systems. DRAs are normally used with the support of a ground plane. The radiation and impedance properties therefore depend not only on their physical dimensions and dielectric properties, but also on the size of the ground plane. ...

متن کامل

The Effect of Adding Different Levels of Valine in Low Protein Diets on Performance, Blood Parameters and Tibial Bone Properties of Ross-308 Broiler Chickens from 8-21 Days

Extende Abstract Introduction and Objective: The present research was performed to evaluate the effect of different levels of valine in low protein diets on performance, blood parameters and tibia bone properties of Ross-308 broiler chickens from 8-21 days. Material and Methods: For this purpose, a total of 200 one-day-old male broilers of Ross-308 strain from 8 to 21 days of age in 4 treatm...

متن کامل

Formulation and optimization of a new cationic lipid-modified PLGA nanoparticle as delivery system for Mycobacterium tuberculosis HspX/EsxS fusion protein: An experimental design

Polymeric particles and liposomes are efficient tools to overcome the low immunogenicity of subunit vaccines. The aim of the present study was formulation and optimization of a new cationic lipid-modified PLGA nanoparticles (NPs) as a delivery system for Mycobacterium tuberculosis HspX/EsxS fusion protein. The cationic lipid-modified PLGA NPs containing HspX/EsxS fusion protein were prepared us...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Bioinformatics

دوره 21 11  شماره 

صفحات  -

تاریخ انتشار 2005